IRC-CLVul: Cross-Programming-Language Vulnerability Detection with Intermediate Representations and Combined Features

نویسندگان

چکیده

The most severe problem in cross-programming languages is feature extraction due to different tokens programming languages. To solve this problem, we propose a cross-programming-language vulnerability detection method paper, IRC-CLVul, based on intermediate representation and combined features. Specifically, first converted programs into unified LLVM (LLVM-IR) provide classification basis for Afterwards, extracted the code sequences control flow graphs of samples, used semantic model extract program information graph structure information, concatenated them vectors. Finally, Random Forest learn vectors obtained results. We conducted experiments 85,811 samples from Juliet test suite C, C++, Java. results show that our improved accuracy by 7% compared with two baseline algorithms, F1 score showed 12% increase.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

(CLSCR) Cross Language Source Code Reuse Detection Using Intermediate Language

In today's digital era information access is just a click away. so computer science students also have easy access to all the source codes from different websites thus it has become difficult for academicians to detect source code reuse in students programming assignments. The new trend in the area of source code reuse is using the source code by translating it in another programming language p...

متن کامل

Intermediate integer programming representations using value disjunctions

We introduce a general technique to create an extended formulation of a mixed-integer program. We classify the integer variables into blocks, each of which generates a finite set of vector values. The extended formulation is constructed by creating a new binary variable for each generated value. Initial experiments show that the extended formulation can have a more compact complete description ...

متن کامل

Programming Language Features for Refinement

module A { type T function F(x: T): T function Twice(x: T): T { F(F(x)) } } module B refines A { type T = T’ datatype T’ = Leaf(int) | Node(T, T) function F... { match x case Leaf(w) => Leaf(w+1) case Node(left, right) => Node(F(left), F(right)) } } Figure 0. Two example modules, one (B) declared as a refinement of the other (A). • Refine previously given specifications (for example, strengthen...

متن کامل

English Event Detection With Translated Language Features

We propose novel radical features from automatic translation for event extraction. Event detection is a complex language processing task for which it is expensive to collect training data, making generalisation challenging. We derive meaningful subword features from automatic translations into target language. Results suggest this method is particularly useful when using languages with writing ...

متن کامل

Cross-Language Prominence Detection

We explore the ability to perform automatic prosodic analysis in one language using models trained on another. If we are successful, we should be able to identify prosodic elements in a language for which little or no prosodically labeled training data is available, using models trained on a language for which such training data exists. Given the laborious nature of manual prosodic annotation, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12143067